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Abstract 

In this paper, a dual-hop communication system composed of a source S and a destination © connected through 
two non-interfering half-duplex relays, TZi and 7^.2, is considered. In the literature of Information Theory, this 
configuration is known as the diamond channel. In this setup, four transmission modes are present, namely: 1) 
S transmits, and TZi and TZ2 listen (broadcast mode), 2) S transmits, TZi listens, and simultaneously, TZ2 transmits 
and D listens. 3) 5 transmits, TZ2 listens, and simultaneously, TZi transmits and T> listens. 4) TZi, TZ2 transmit, and V 
listens (multiple-access mode). Assuming a constant power constraint for all transmitters, a parameter A is defined, 
which captures some important features of the channel. It is proven that for A = the capacity of the channel can 
be attained by successive relaying, i.e, using modes 2 and 3 defined above in a successive manner. This strategy 
may have an infinite gap from the capacity of the channel when A 7^ 0. To achieve rates as close as 0.71 bits to 
the capacity, it is shown that the cases of A > and A < should be treated differently. Using new upper bounds 
based on the dual problem of the linear program associated with the cut-set bounds, it is proven that the successive 
relaying strategy needs to be enhanced by an additional broadcast mode (mode 1), or multiple access mode (mode 
4), for the cases of A < and A > 0, respectively. Furthermore, it is established that under average power constraints 
the aforementioned strategies achieve rates as close as 3.6 bits to the capacity of the channel. 



Index Terms 

Capacity, decode-and-forward, diamond channel, dual problem, gap analysis, half-duplex, linear program. 

I. Introduction 

A. Motivation 

Relay-aided wireless systems, also called multi-hop systems, are implemented to increase the coverage and the 
throughput of communication systems [1]. These systems are becoming important parts of developing wireless 
communication standards, such as IEEE 802. 16j (also known as WiMAX) [2]. Half-duplex relays, which transmit 
and receive data in different times and/or frequencies, are proven to be more practical and cost efficient in such 
standards than full-duplex relays. 

From information theoretical point of view, the capacity becomes larger when more relays are added to the system. 
However, designing optimum strategies, especially in half-duplex systems, is challenging because subtle scheduling, 
i.e., timing among transmission modes, is required to achieve rates near the capacity of such systems. During the 
last decade, the main stream of research carried out by several researchers dealt with single relay communication 
systems (cf. [5] and references therein). A simple model for investigating the potential benefits of a system with 
multiple relays is a dual-hop configuration with two parallel half-duplex relays (see Fig. 1). This configuration does 
not cover all two-relay systems because there are no source-destination and relay-relay links. However, it captures 
the basic difficulty in finding the best strategy in the system. As will be shown in this paper, a single strategy falls 
short of achieving rates near the capacity of the system for all channel realizations. 

This work is financially supported by Nortel Networks and the corresponding matching funds by the Natural Sciences and Engineering 
Research Council of Canada (NSERC), and Ontario Centers of Excellence (OCE). 
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B. History 

The single relay channel in which the relay facilitates a point-to-point communication was first studied in [3]. Two 
important coding techniques, decode-and-forward and compress-and-fonvard, were proposed in [4]. In the decode- 
and-forward scheme, the relay decodes the received message. In the compress-and-forward scheme, the relay sends 
the compressed (quantized) version of the received data to the destination. Following [4], generalizations to multi- 
relay networks were investigated by several researchers. A comprehensive survey of the progress in this area can 
be found in [5]. 

A simple model for understanding some aspects of the multi-relay networks is a network with two parallel relays, 
as introduced in [6], [7], and Fig. 1. It is assumed that there are no direct links from the source to the destination 
and also between the relays. This channel is studied in [8]-[16] and [21], and referred to as the diamond relay 
channel in [12]. 

For full-duplex relays, Schein and Gallager, in [6] and [7], provided upper and lower bounds on the capacity of 
the diamond channel. In particular, they considered the amplify-and-forward, and the decode-and-forward schemes, 
as well as a hybrid of them based on the time-sharing principle. Kochman, et al. proposed a rematch-and-forward 
scheme when different fractions of bandwidth can be allotted to the first and second hops [8]. Rezaei, et al. 
suggested a combined amplify-and-decode-forward strategy and proved that their scheme always performs better 
than the rematch-and-forward scheme [9]. In addition, they showed that the time-sharing between the combined 
amplify-and-decode-forward and decode-and-forward schemes provides a better achievable rate when compared 
to the time-sharing between the amplify-and-forward and decode-and-forward, and also between the rematch-and- 
forward and decode-and-forward, considered in [7], and [8], respectively. Kang and Ulukus employed a combination 
of the decode-and-forward and compress-and-forward schemes to obtain the capacity of a special class of the 
diamond channel with a noiseless relay [10]. Ghabeli and Aref in [11] proposed a new achievable rate based on 
the generalized block Markov encoding [23]. They also showed that their scheme achieves the capacity of a class 
of deterministic relay networks. 

Half-duplex relays are studied in [12]-[18]. Xue and Sandhu in [12] proposed several schemes including the 
multi-hop with spatial reuse, scale-forward, broadcast-multiaccess with common message, compress-and-forward, 
and hybrid methods. These authors demonstrated that the multi-hop with spatial reuse protocol can achieve the 
channel capacity if the parallel links have the same capacity. Unlike [6]-[10], [12], [21], which assumed no direct 
link exists between the relays, [13]-[15] considered such link. More specifically, Chang, et al. proposed a combined 
dirty paper coding and block Markov encoding scheme [14]. Using numerical examples, they showed that the gap 
between their proposed strategy and the upper bound is relatively small in most cases. Rezaei, et al. considered two 
scheduling algorithms, namely successive and simultaneous relaying [15]. They derived asymptotic capacity results 
for the successive relaying and also proposed an achievable rate for the simultaneous relaying using a combination 
of the amplify-and-forward and decode-and-forward schemes. Other related papers are [17]-[20]. 

Characterizing the capacity of an information theoretic channel may be difficult. A simpler, yet important approach 
is to find an achievable scheme that ensures a small gap from the capacity of the channel. Recently, Etkin et al. 
characterized the capacity region of the interference channel to within one bit [26]. Following this new capacity 
analysis perspective, Avestimehr et al. proposed a deterministic model to better analyze the general single-source 
single-destination and the single-source multi-destination Gaussian networks [16], [21]. Their quantize -and-map 
achievablity scheme is guaranteed to provide a rate that is within a constant number of bits (determined by the 
graph topology of the network) from the cut-set upper bound. 

C. Relation to Previous Works 

In this paper, the setup and assumptions used in [12], with no link between the relays, are followed. In [12], the 
multi-hop with spatial reuse scheme proved to achieve the capacity of the diamond channel if the capacities of the 
parallel links in Fig. 1 are equal. This is called the Multi-hopping Decode-and-Forward (MDF) scheme. In the MDF 
scheme, relays successively forward their decoded messages to the destination (see Forward Modes I and II in Fig. 
2). By introducing a fundamental parameter of the channel A (see Fig. 1), we generalize the optimality condition 
of the MDF scheme. In particular, we show that whenever A = 0, the cut-set upper bound can be achieved. We 
also show that the MDF scheme cannot have a small gap from the cut-set upper bound for all channel realizations 
because the optimum strategy is highly related to the value of A. 

In [16], the aim has been to establish the constant gap argument for the general relay networks with a single 
source and not to obtain a small gap optimized for a specific channel, such as the diamond channel. For the half- 
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2 

Relay 2: 7^2 



A — CoiCq2 — C13C23 
Fig. 1. The diamond channel with its fundamental parameter A. 



duplex diamond channel, the expression for the gap derived in [16] results in a 6-bit gap. In this paper, however, 
we focus on the diamond channel and obtain a smaller gap using our proposed achievablity scheme. In addition, 
we provide closed-form expressions for the time intervals associated with the transmission modes in the proposed 
scheduling. Specifically, we show that the expressions are different from those of the cut-set upper bound. This is in 
contrast to [16], where the constant gap between the cut-set bound and the quantize-and-map scheme was assured 
for every fixed scheduling, including the optimum scheduling associated with the cut-set upper bound. 

In [16], using a different achievablity (a partial decode-and-forward) scheme than the quantize-and-map scheme, 
Avestimehr et al. showed that the capacity of the full-duplex diamond channel can be characterized within 1 bit 
per real dimension, regardless of the values of the channel gains. However, applying this scheme to the half- 
duplex diamond channel does not guarantee a constant gap from the channel capacity. We take one further step 
by providing an achievable scheme that ensures a small gap from the upper bounds for the half-duplex diamond 
channel. In particular, we show that the gap is smaller than .71 bits, assuming all transmitters have constant power 
constraints. We also prove that when transmitters have average power constraints instead, the gap is less than 3.6 
bits. 

The rest of this paper is organized as follows: Section II introduces the system model, the main ideas and results 
of this work. Section III presents the MDF scheme, which achieves the channel capacity for A = 0. Sections IV and 

V provide the achievable schemes, upper bounds, and gap analysis for A < and A > cases, respectively. Section 

VI concludes the paper. In addition. Appendix [Al characterizes the Generalized Degrees Of Freedom (GDOF) of the 
diamond channel to obtain asymptotic capacity of the channel. Finally, Appendix iBl addresses the diamond channel 
with average power constraints. 



D. Notations 

Throughout the paper, x = 1— x, and x* denotes the optimal solution to an optimization problem with an objective 
function F{x). The transpose of the vector or matrix A is indicated by A^. a^b represents the link from node a 
to node h. Also, x^y means that the roles of x and y are exchanged in a given function F{x, y). In addition, it 
is assumed that all logarithms are to base 2. Finally, C{P) = ^ log (1 + P)- 

II. Problem Statement and Main Results 

In this work, a dual-hop communication system, depicted in Fig. 1, is considered. The model consists of a source 
(5), two parallel half-duplex relays {TZi, Tl2), and a destination (2?), respectively, indexed by 0, 1, 2, and 3 as 
shown in Fig. 1. No link is assumed between Source and Destination, as well as between the relays. The channel 
gain between node a and b is assumed to be constant, known to all nodes, and is represented by hat with magnitude 

Due to the half-duplex constraint, four transmission modes exist in the diamond channel where, in every mode, 

(i) 

each relay either transmits data to Destination or receives data from Source (see Fig. 2). In the figure, Xa and 
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Relay 1 




9 Destination 



Relay 2 

a) Mode 1 with duration ti : 

The source transmits the vector xl^' . 

(1' 

The first and second relay receive yi 
and 5^2^^', respectively. 



Broadcast Mode 



Relay 1 




Destination 



Relay 2 



c) Mode 3 with duration 

The source and the first relay transmit 
the vectors Xq'^' and xf ' . 
The second relay and the destination receive 
y2^^ and y^^^^ respectively. 



Forward Mode II 



Fig. 2. Transmission modes for the diamond channel. 



Relay 1 



Source 




Destination 



Relay 2 



b) Mode 2 with duration 

The source and the second relay transmit 
the vectors Xq^' and X2^'. 
The first relay and the destination receive 
yj^' and y^P, respectively. 

Forward Mode I 



Relay 1 



Source 9 




Destination 



Relay 2 



d) Mode 4 with duration ti. 

The relays transmit the vectors x.i' 
and X2 . 

The destination receives y3'''. 



Multiple-Access Mode 



Fa represent the transmitting and receiving signals at node a corresponding to mode i, respectively. The total 
transmission time is normalized to one and partitioned into four time intervals (ti, ^3, t^) corresponding to modes 
1, 2, 3, and 4, with the constraint X^iLi = 1- The discrete-time baseband representation of the received signals at 
Relay 1, Relay 2, and Destination are respectively given by: 

Yi^hoiXo + Ni, 
Y2^ha2Xa + N2, 

^13-'^! + ^23-^^2 + ^3, 

where Na is the Gaussian noise at node a with unit variance. 

(i) (i) (i) 

Let us assume Source, Relay 1, and Relay 2 consume, respectively, Pg , P-ji^, and Pj^l^ amount of power in 
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mode i, i.e., 



' u 
' u 



- t 



The total power constraints for Source, Relay 1, and Relay 2 are Ps, P-Rx, and Ptcjj respectively, and are related 
to the amount of power spent in each mode as follows: 



i=l 
4 



i=l 
4 



Due to some practical considerations on the power constraints [12], we mainly consider constant power constraints 
for transmitters, i.e., for i e {1, • • • , 4}, 

P^'^^Ps, 

P^n\=Pn,, (1) 
P^nl=Pn.- 

Without loss of generality, a unit power constraint is considered for all nodes, i.e., Pg ~ P-Ri — Pn2 = 1- We 
define the parameters Cqi, C02, C13, C23 as C(5oi), ^(.902), C(5i3), C(523), respectively. Moreover, C012 and C123 
are defined as: 

C012 =C(goi+5o2), ^2) 
C123 =C{{^/gE + ^/g^f). 

The case in which transmitters have average power constraints instead of constant power constraints is addressed 
in Appendix IB] 

In this work, we are interested in finding communication protocols that operate close to the channel capacity. 
We introduce an important parameter of the channel A as: 

A = C'oiCo2 — C13C23. (3) 

We categorize all realizations of the diamond channel into three groups based on the sign of A {i.e., A<0, A = 0, 
and A > 0). As will be shown in the sequel, the sign of A plays an important role in designing the optimum 
scheduling for the channel. 

In this setup, the cut-set bounds can be stated in the form of a Linear Program (LP) due to the assumption 
of constant power constraints for all transmitters. By analyzing the dual program we provide fairly tight upper 
bounds expressed as single equations corresponding to different channel conditions. Using the dual problem, we 
prove that when A = 0, the MDF scheme achieves the capacity of the diamond channel. Note that A = {i.e., 
C[)iC[)2 = C13C23) includes the previous optimality condition presented in [12] {i.e., Cqi = C23 and (7o2 = Ciz) 
as a special case. To realize how close the MDF scheme performs to the capacity of the channel when A 7^ 0, we 
calculate the gap from the upper bounds. We show that the MDF scheme provides the gap of less than 1.21 bits 
when applied in the symmetric or some classes of asymmetric diamond channels. More importantly, we explain 
that the gap can be arbitrarily large for certain ranges of parameters. 

By employing new scheduling algorithms we shrink the gap to .71 bits for all channel conditions. In particular, 
for A < we add Broadcast (BC) Mode (shown in Fig. 2) to the MDF scheme to provide the relays with more 
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reception time. In this three-mode scheme, referred to as Multi-hopping Decode-and-Forward with Broadcast (MDF- 
BC) scheme, the relays decode what they have received from Source and forward the re-encoded information to 
Destination in Forward Modes 1 and II. When A > 0, Multiple-Access (MAC) mode (shown in Fig. 2) in which 
the relays transmit independent information to Destination is added to the MDF scheme. We call this protocol 
Multi-hopping Decode-and-Forward with Multiple-Access (MDF-MAC) scheme. 

The mentioned contributions are associated with the case wherein the transmitters are operating under constant 
power constraints ([T]i- However, for a more general setting in which the transmitters are subject to average power 
constraints (HTl i. it is shown in Appendix |B] that the cut-set upper bounds are increased by at most 2.89 bits. 
Therefore, the proposed achievable schemes guarantee the maximum gap of 3.6 bits from the cut-set upper bounds 
in the general average power constraint setting. 

A. Coding Scheme 

The proposed achievable scheme may employ all four transmission modes as follows: 

1) Broadcast Mode: In ti fraction of the transmission time. Source broadcasts independent information to Relays 
1 and 2 using the superposition coding technique. 

2) Forward Mode I: In t2 fraction of the transmission time. Source transmits new information to Relay 1. At 
the same time. Relay 2 sends the re-encoded version of part of the data received during Broadcast Mode 
and/or Forward Mode II of the previous block to Destination. 

3) Forward Mode II: In fraction of the transmission time. Source transmits new information to Relay 2. At 
the same time. Relay 1 sends the re-encoded version of part of what it has received during Broadcast Mode 
and/or Forward Mode I of the previous block to Destination. 

4) Multiple-Access Mode: In the remaining t4 fraction of the transmission time. Relays 1 and 2 simultaneously 
transmit the residual information (corresponding to the previous block) to Destination where, joint decoding 
is performed to decode the received data. 

In Broadcast Mode, superposition coding, which is known to be the optimal transmission scheme for the degraded 
broadcast channel [28], is used to transmit independent data to the relays. The resulting data-rates u and v, 
respectively associated with Relay 1 and Relay 2 are: 

^ \ Coi -C(775oi) if <7oi < 502, 

= ( ^r^f''^"'^ I (5) 

^ " \ C(W02) if 301 < 502. 

The power allocation parameter rj determines the amount of Source power used to transmit information to the relay 
with better channel quality in Broadcast Mode. 

In Multiple-Access Mode, a multiple-access channel exists in which the users (relays) have independent messages 
for Destination. For this channel, joint decoding is optimum, which provides the following rate region [28]: 

i?l < t4Ci3, 

i?2 <UC23, (6) 
i?l + i?2 < tACMAC, 

where Ri , i?2 are the rates that Relay 1 and Relay 2 provide to Destination in Multiple-Access Mode, respectively, 
and Cmac is defined as: 

Cmac = C(.gi3 +523)- (7) 

According to the protocol. Relay 1 can receive up to tiu + t2Cai bits per channel use during Broadcast Mode 
and Forward Mode I. Then the relay has the opportunity to send its received information to Destination in Forward 
Mode II and Multiple-Access Mode, with the rate iaCia + Similarly, Relay 2 can receive and forward messages 
with the rates tiv + <3Co2, and ^2(^23 + R2, respectively. Therefore, the maximum achievable rate of the scheme, 
R, is: 



R= max lmin{tiu + t2Cm,hCi3. + Ri] +m\n{tiv + t3Co2,t2C23+ R2\>- (8) 

Sections Ullirv] show that employing Forward Modes I and II for A = 0, the first three transmission modes for 
A < 0, and the last three transmission modes for A > are sufficient to achieve a small gap from the derived upper 
bounds. 
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B. Cut-set Upper Bound and the Dual Program 

For general half-duplex networks with K relays, Khojastepour et al. proposed a cut-set type upper bound by 
doing the following steps: 

1) Fix the input distribution and scheduling, i.e., p{Xo, Xi, X2), and ^1,^2,^3,^4 such that ^ 1- 

2) Find the rate Ri j associated with the cut j for each transmission mode i where j G {1, • • • , 2^}. 

3) Multiply Ri , by the corresponding time interval ti. 

4) Compute ti^i.j minimize it over all cuts. 

5) Take the supremum over all input distributions and schedulings. 

The preceding procedure can be directly applied to the diamond channel, whose transmission modes are shown in 
Fig. 2. The best input distribution and scheduling lead to: 



CDC<tiI{X^^'>;Yl^^)+t2' 



2 ) + hiix'„ 



(3). v(3)| ^(3) 



yI^^) + i{x^ 



(2) 



Y, 



Cdc < hI{X^^^;Y^'-^) + t2.0 + h[l{X^^^;Y., 



Y^-'>\X\''>)+U.Q, 

^(4). v(4)| 



3 )\+t,.Q + UI{Xlt>,Yr\X\->l 



Cdc < h.0 + t2liX^'';Y^'')+hIix['^;Y^^')+UI{Xl^',X^-';Yf'), 

where Cdc denotes the capacity of the diamond channel. The above bounds do not decrease if each mutual 
information term is replaced by its maximum value. This substitution simplifies the computation of the upper 
bound, called i?up, by providing the following LP [12]: 



2 ) 
(3)n 



I{X[ 



(3).y(3). 



UIix['^;Yt^\X^% 



.(4)x 



maximize 
subject to: 



up 



R 

Rap < ^1^012 + t2Coi + t^Ct^2 + ^4-0 

i?up < iiCoi + i2(Coi + C23) + h.Q + UC23 

Rap < ^1^*02 + ^2-0 + t3(Co2 + C'ls) + ^4(^13 
+ ^2(^23 + i3C'i3 + tiCi2S. 

1, U > 0. 



(9) 



<-up 



To obtain appropriate single-equation upper bounds on the capacity, we rely on the fact that every feasible point in 
the dual program provides an upper bound on the primal. Hence, we develop the desired upper bounds by looking 
at the dual program. In the sequel, we derive the dual program for the LP (|9]l. 
We start with writing the LP in the standard form as: 



maximize 
subject to: 



T 
C X 

Ax < b 

X > 0, 



where the unknown vector x = 
of coefficients A is: 



■ [^1,^2,^; 



) t4, ^up]^, the vectors of coefficients b = c= [0, 0, 0, 0, 1]^, and the matrix 





— Coi2 


— Cqi 


-~Co2 





1 \ 




—Coi 


— {Coi + C23) 





^C23 


1 




— Co2 





— {C02 + C13) 


~Cl3 


1 







~C23 


— Cl3 


— C123 


1 


v 


1 


1 


1 


1 


/ 



Since A = A , it is easy to verify that the primal and dual programs share the same form, i.e.. 



minimize 
subject to: 



Rap 

Rap > TiCqi 



^2^01 



R, 



T3C02 + T4.0 
up > TiCoi + r2(Coi + C23) + r3.0 + r4C23 

i?up > T1C02 +T2.O + T3(Co2 + C13) + r4Ci3 
-Rup > Ti.O + T2C23 + T3C13 + r4Ci23 

T,t=lTi = 1> > 0- 



(10) 



In the dual program ( fTOl l, r^, for i € {1, • • • ,4} corresponds to the ith rate constraint in the primal LP (|9]). Clearly, 
the LP (|9|l is feasible. Hence, the duality of linear programming ensures that there is no gap between the primal 
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and the dual solutions [27]. However, the benefit of using the dual problem here is that any feasible choice of the 
vector T provides an upper bound to the rate obtained by solving the original LP. This property is known as the 
weak duality property of LP [27]. Appropriate vectors (i.e., r's) in the dual program ( fTOb are selected to obtain 
fairly tight upper bounds. In fact, employing such vectors instead of solving the primal LP ^ simplifies the gap 
analysis. In sections HVl and [V] these vectors are provided for A<0 and A>0 cases, respectively. In the following 
sections, we employ the proposed achievable schemes together with the derived upper bounds to characterize the 
capacity of the diamond channel up to 0.71 bits. 

III. MDF Scheme and Achieving the Capacity for A = 
In this section, the MDF scheme is described and then proved to be capacity-achieving when A = 0. 



A. MDF Scheme 

The MDF scheduling algorithm uses two transmission modes: Forward Modes I and II shown in Fig. 2 along 
with the decode-and-forward strategy and can be described as follows: 

1) In A fraction of the transmission time. Source and Relay 2 transmit to Relay 1 and Destination, respectively. 

2) In the remaining A fraction of the transmission time. Source and Relay 1 transmit to Relay 2 and Destination, 
respectively. 

The achievable rate of the MDF scheme is the summation of the rates of the first and second parallel paths 
(branches) from Source to Destination, which can be expressed as [12]: 

^^MDF = { niin{ACoi, AC13} + min{ACo2, AC23}|- 

The above LP can be re- written as: 

maximize i?i + R2 
subject to: i?i < ACoi 

i?i < AC13 

i?2 ^ AC02 

7?2 ^ AC23 

< A < 1, 

where i?i and R2 denote the rate of the upper and the lower branches, respectively. This LP has three unknowns 
R2, A) and six inequalities. The solution turns three out of six inequalities to equality. The optimum time interval 
A* can not be equal to or 1, as both solutions give a zero rate. Hence, three out of the first four inequalities 
should become equality, which leads to the following achievable rates for different channel conditions: 

^MDF-%g^S^ ifA<0,Co2<Coi 
i?^DF = ^22(^01+6^ if A<0,Co2>Coi 

O02 + 023 

n3 _ CujCpi + C23) if A^n r <r 
-Kmdf - — Cm X C\-i — II ^ > O23 < Oi3 

n4 _ C23(Co2 + C13) -.t \^(\ r T 

^MDF — T im It ZA>U, 023>Oi3. 

O02 + <-^23 

In particular, the achievable rate for the symmetric diamond channel, in which Coi = C02 and C13 = C23, is: 

-^MDF — minjC'oi, C'ls}- 
The optimum time interval A* is either equal to \\ or Aj defined below: 

{ Ci3 A \* 
C01+C13 '^l' 
or 
Cqi a y*- 
C02+C23 

Note that if A* = XI, then A^Coi = A^Cis. Similarly, A* = A2 leads to A2C02 = A2C23. In other words. A* for 
i E {1,2} makes the maximum amount of data that can be received by Relay i equal to the maximum amount of 



R 



MDF 
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data that can be forwarded by Relay i. In this case, branch i (composed of ^ ?' ^ 3 links) is said to be fully 
utilized. 

It is interesting to consider that the case fully utilizing branch 1 or branch 2 leads to the same data-rate. This 
case occurs when one of the following happens: 

A =0, 

Coi =Co2 ifA<0, (12) 
Ci3 =C23 ifA>0. 

In these situations, one can use either or A2 fraction of the transmission time for Forward Mode I and the 
remaining fraction for Forward Mode II and achieve the same data-rate. It will be shown later that the MDF 
scheme achieves the capacity of the diamond channel if A = and is at most 1.21 bits less than the capacity for the 
other two cases. It is remarked that A = makes both branches fully utilized and all four rates in Eq. (fTTT i equal. 



B. MDF is Optimal for A = 

Here, it is explained that R*^, found by solving the dual-program ( fTOl ). is the same as the MDF rate given in Eq 
( fTTl i for A = 0. It is easy to observe that 

C13 C23 



0, 



,0 



(13) 



Cqi + Ci3 ' C02 + C23 

makes all four rate constraints in the dual-program (fTOl i equal to the rate obtained in Eq. (fTTT l and satisfies = 
1. Therefore, the upper bound provided by vector t is indeed the capacity of the channel and equals to: 

C01C13 C02C23 

L-DC = \- . ^ ■ (14) 

(-01 + Ol3 O02 + '-^23 

The result is valid for the Gaussian multiple antenna as well as discrete memoryless channels, and therefore A = 
ensures the optimality of the MDF scheme for those channels too. 



C. MDF Gap Analysis 

To investigate how close the MDF scheme performs to the capacity of the diamond channel when A 7^ 0, the 
appropriate upper bounds are required, which will be derived in sections |IV] and |V] Therefore, the detailed gap 
analysis for the MDF scheme is deferred to Appendix |C] where it is shown that although a small gap is achievable 
for some channel conditions, the gap can be large in general. In the following sections. Broadcast and Multiple- 
Access Modes are added to the MDF algorithm to achieve 0.71 bits of the capacity for A > and A < cases, 
respectively. 

IV. MDF-BC Scheme and Achieving within 0.71 Bits of the Capacity for A < 

In the MDF scheme, since both branches cannot be fully utilized when A < simultaneously, there exists some 
unused capacity in the second hop. To efficiently make use of the available resources. Broadcast Mode is added to 
the MDF scheme. This mode provides the relays with an additional reception time. 



A. Achievable Scheme 

The modified protocol uses Broadcast Mode together with Forward Modes I and II. Therefore, by setting ^4 = 
in Eq. (O the maximum achievable rate of the scheme as a function of the power allocation parameter ij used in 
superposition coding is: 



RBcifl) = max <^ min{tiu{r]) + ^2(^01, t3C'i3} + min{tiw(r/) + ^3(^02, ^2(^23} 

Recall that u, and v, defined respectively in Eqs. (|4]i and (|5]l, are the rates associated with Relays 1 and 2 in 
Broadcast Mode. First, the optimal schedule is obtained, assuming a fixed 77, and later an appropriate value for 7] 
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will be selected. The achievable rate can be written as the following LP: 

maximize Rbc 

subject to: i?BC < ti{u + v) + ^aCoi + ^€02 (15) 

RBC<tiU + t2{Coi+C23)+t3.0 (16) 
RBC<tlV+t2.0 + hiCo2+Cu) (17) 

-Rbc < ^2^23 + hCis (18) 

3 

Y^U^l (19) 

4=1 

U > 0. (20) 

For a feasible LP, the solution is at one of the extreme points of the constraint set. One of the extreme points can be 
obtained by solving a set of linear equations containing Eq. ( fT9] l and inequalities (fT5Tl-(fT7]i considered as equalities. 
The solution becomes: 

ti= 

t2 = 







-A 




(Coi - 


H Ci3.)v + 


(C02 + C23)u 


-A' 




C13V 


+ C02U 




(Coi - 


f Ci3)v + 


(C02 + C23)u 


-A' 




Coiv 


+ C23U 





' {Coi + Ci3)v + {Co2 + C23)u~A' 
„ f , Ci3(Coi + C23)V{V) + g23(C02 + Ci3)u{l]) 

'^'^^'^^ iCol + c'uHv) + {C02 + C23HV) - A ■ ^ ' 

It is easy to verify that A = makes ti = 0, and hence leads to the MDF algorithm. Note that in addition to 
inequalities (fTSt-lfTTb. the above extreme point also turns inequality (fTsT l into equality. Now, this extreme point 
is proven to be the solution to the above LP. If one of the elements of vector t is increased, at least one of the 
conditions (fTSt-lfTTl) provides a smaller rate, compared to the rate obtained by the extreme point. For instance, if 
ti in Eq. (I2TI 1 is increased, then, because of Eq. (1% . at least one of t2 and should be decreased, which in turn 
reduces the rate associated with the inequality ( fTsl l. This confirms that the extreme point is the optimal solution to 
the LP with constraints (fT5]l-(l2Qli. 

In the following, instead of searching for 77*, which maximizes Rbc'{v)^ appropriate value for 77 is found that 
not only provides a small gap from the upper bounds, but also simplifies the gap analysis of section HV-CI The power 
allocation parameter 77 is selected to be either 771 = g^^^^i , or 7/2 — g„l+i ^'^^ ^02 — '-^oi and Coi > C02 conditions, 
respectively. As it will be shown in Appendix [A) the chosen r/ produces the same GDOF as the corresponding 
upper bound, which is a necessary condition in obtaining a small gap. The corresponding u and v for 771 are: 

u{i]i)= Coi - Ci, 

vim)=Coi2-Coi, (22) 

and for 772 are: 

uim)^ C012 - C02, 

vim)= C02 - C2. (23) 

In the above, 

Ci^C(^2L^)<i (24) 

501 + 1 2 

C2=C(^^^^)<i. (25) 

502 + 1 2 

The selected 77 divides the source power between u and v (considered as the rates of two virtual users in the 
broadcast channel consisting of S^TZi and 5=J>7?.2 links) in such a way that: 
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1) the sum data-rate (i.e., u + v) in the broadcast channel is close to the sum-capacity of the broadcast channel 
(i.e., max{Coi,Co2}), 

2) the weaker user's rate is close to its capacity. For instance, if Cqi < C02, then u « Cqi- 

Substituting u and v from Eqs. (l22b and ( |23] ) into ( I2TI 1 leads to the following achievable rates i?MDF-BC -^mdf-bc 
corresponding to 771 and 772: 

_ CisiCoi + C23)Coi2 — C'oi^lS + C'oi (^02^^23 ~ ClC23(C'o2 + 



pi 

-f^MDF-BC" 



(Coi + Ci3)(Coi2 — Cqi + C23) — ^(^"02 + C*; 



23 J 



_ C'23(Co2 + C'i3)Coi2 — C02C23 + C01C02C13 — C2C'i3(Coi + C23) ^^^.^ 
^'^^^ iCo2 + C'23)(C'oi2 — (^02 + C13) — C2(C'oi + C13) 

B. Upper Bound 

Following the discussion in section HTbI we select one of the extreme points of the constraint set ( fTOl i to obtain 
a fairly tight upper bound. Below, some insights on how to find an appropriate extreme point are given. 

First, Forward Modes I and II play an important role in data transfer from Source to Destination. These two 
modes let both Source and Destination be simultaneously active, which is important for efficient communication. 
This implies that generally t2 and tg are not zero in the original LP In addition, A < roughly means that the 
second hop is better than the first hop. In this case. Broadcast Mode helps the relays to collect more data which will 
be sent to Destination using Forward Modes I and II later Therefore, Multiple-Access Mode is less important when 
A < and consequently t4 can be set to zero. Using the complementary slackness theorem of linear programming 
(cf. [27]), having non-zero ti,t2, and in the original LP translates into having the first three inequalities in the 
dual program satisfied with equality. Now looking at the dual problem ( fTOl ) with the same structure as the original 
LP, in order to achieve a smaller objective function, we set T2 or T3 to zero. This is in contrast to the claim for 
having both of t2 and non-zero in the original LP with the maximization objective. Therefore, the vector r with 
the following properties is selected: 

1) Either T2 or T3 is zero. 

2) The first three inequalities are satisfied with equality. 

To have a valid r, we need to make sure that all the elements of vector t are non-negative and that t satisfies the 
last condition. 

As mentioned earlier, either T2 or T3 can be set to zero in the dual program (fTOt . For instance, setting T2=0 in 
the dual-program gives the following LP: 

minimize R 

subject to: R > T1C012 + T3C02 + T4.O 

R > nCoi + T3.0 + T4C23 ^27) 

R > T1C02 + T3(Co2 + C13) + T4C13 
R>Ti.0 + T3Ci3 + TiCi23 
T,t=1.^^2'^^ = 1, > 0. 

Setting the first three inequalities to equalities gives: 

Cl3 

^ Coi2 — Cq2 + Ci3 ' 

C23(Coi2 ~ C02) — C'l3(Coi2 — Cqi) 
^ iCo2 + C23)iCai2 — C02 + C13) 

C'l3(C'oi2 — Coi) + Co2(C'oi2 — C02) 

(C02 + C'23)(C'oi2 — Cq2 + C13) 
(C02 + Ci3)C23 ^ C'l3(CoiCo2 — Cl3C'23) ^23-) 

C02 + C23 (C02 + C'23)(Coi2 — C02 + C13) 

For obtaining a valid result, the following conditions have to be ensured: 

1) T3* > 0. 

Since C02 < C012, the denominator of T3 is non-negative, therefore, the non-negativity of the nominator has 
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to be guaranteed. This imposes the constraint F > on the values of channel parameters, where F is defined 
as: 

r = C23[Coi2 — C02] — C'l3[C'oi2 — Coi]. (29) 

2) R* >T^Cr3+TlCi23. 

To satisfy the following condition: 

R* = T^Cq2 + T3 (C02 + C13) + t|Ci3> 7-3^13 + T4C123, 

it is sufficient to show: 

(T*+r3*)Co2>T4*(Ci23-Ci3), 

which can be equivalently represented as: 

C02 . * 
— ''"4 ■ 

C'123 ~ Ci3 + C02 

The following lemma proves the preceding inequality. 

Lemma 1 t| < c,,,-c°M. Z^'" ^123 < C13 + C23. 

Proof: See Appendix ID-AI . ■ 
Lemma [T] requires C123 < C13 + C23, which is not true for 313.923 ^ 4. To be able to use Lemma [T| for the case 
of ffi3<?23 < 4, we replace either C13 by Ci^ = C13 + 6 or C23 by 6*23 = C23 + S with (5 defined as: 

5 ^ max{Ci23 - (Cis + C23), 0}. (30) 

This change provides the desired inequality (i.e., C123 < C13 + C23 or C123 < C13 + C23) at the expense of 
increasing the upper bound. However, we will show in Lemma |2] that this increase is always less than S. We will 
prove that 6 itself is bounded in Lemma [3] 

Continuing the derivation of the upper bound from the LP ( |27] |. if C123 > C13 + C23, then C23 is replaced by 
C23. In this case, the dual program ( [27] ) remains unchanged except for C23. Hence, the set of solutions ( |28] ) can 
be used by replacing C23 with C23 and thus the upper bound becomes: 

~* ^ iC23 + S){Co2 + Cr3) ^ C,3{C,iCo2 - C^3{C23 + 5)) ^^^^ 
Cq2 + C23 + 5 (C02 + ^23 + '5)(Coi2 — Cq2 + C13) 

Note that the inequality f 3 > holds because f > simply follows from F > (Q. According to Lemma [1] since 

C123 =C'i3 + C'23, the condition R > fg C13 C123 is satisfied. Lemma |2] shows that the enlarged upper bound 

^* 

R (Eq. < [3B ) is at most 5 bits greater than the upper bound of < [28] t. 

Lemma 2 // C123 > C13 + C23, then R - R* < 5. 

Proof: See Appendix ID-BI ■ 
Therefore, the proposed upper bound for A < and F > is: 

C'23(C'o2 + C13) C'i3A 



R, 



""^ CQ2+C23 (C012 — (^02 + Ci3)(Co2 + (^23) 



'The superscript " is used to indicate parameters associated witli (723 ■ instance, f lias the same formula as F in Eq. (29), with C23 
replaced by C2i- 
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Similarly, when A < and F < 0, T3 is set to zero and again the first three inequalities are assumed to be satisfied 
with equality in the dual-program (fTOl l. Following the same procedure, the subsequent results are achieved: 

C23 

C012 — C'oi + C23 ' 

C'i3(Coi2 — Coi) — C23(Coi2 — C02) 
^ (C'oi + Ci3)(Coi2 — C'oi + C23) 

C'23(Coi2 — C'02) + C'oi(Coi2 — Coi) 

(C'oi + C'i3)(Coi2 — C'oi + C23) 
(C'oi + C'23)Ci3 C'23(CoiCo2 — C'l3C'23) 



R 



C'oi + Ci3 (Coi + Ci3)(Coi2 — C'oi + C23) 

^1 _ C'i3(Coi + C23) _| C'23A 

C'oi + Ci3 (^012 — C'oi + ^23) (Coi + C13) 



(32) 



In this case, when C123 > C13 + C23, C13 is replaced by C13 + S, it is easy to see that the preceding results can 
be obtained by exchanging the roles of Coi ^ C02, C13 <-+ C23, and T2 ^ T3 in the results derived for the case of 
A < and F > 0. 

In order to be able to achieve a small gap from the upper bounds, 6 should be bounded. Lemma [3] proves that 6 
is smaller than 0.21 bits. 

Lemma 3 < ilog(|). 

Proof: See Appendix ID-C I ■ 

C. Gap Analysis 
The MDF-BC scheme is proposed for the following regions: 

1) A < 0, F < 0, C02 > Coi, and Coi > 1 

2) A < 0, F > 0, Coi > C02, and C02 > 1 

For A < 0, Appendix |C] shows that the MDF scheme provides a small gap from the upper bounds for the remaining 
regions. Here, the first case is considered. The gap kI^dp^^q between the achievable rate i?MDF BC ^'^'^ upper 
bound i?yp is: 

~Cl I (C012 — Coi + C23)(Ci3 — C23) + C23(Co2 + C23) ) A 

„1 _ V / , : 

'^MDF-BC — 



(Coi + Ci3)(Coi2 " Coi + C23) ^(Coi + Ci3)(Coi2 — Coi + C23) — Cl(C'o2 + C23) 

In the following lemma, the gap K^^p is proved to be smaller than + S bits. 

Lemma 4 nljaFBC ^\+^- 

Proof: See Appendix ID-DI ■ 
By exchanging the roles of (701 ^ .902 and 913 <-* g23, the gap for the second case can be easily derived and 
shown to be less than ^ + 6 bits. 

V. MDF-MAC Scheme and Achieving within 0.71 Bits of the Capacity for A > 

Similar to section |IV] a third mode is added to the MDF scheme when A > to effectively utilize the unused 
capacity of the first hop. 
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A. Achievable Scheme 

Here, Multiple-Access Mode is added to the MDF scheme with independent messages sent from the relays to 
Destination. This mode provides the relays with an increased transmission time. The modified protocol uses three 
transmission modes, i.e., Multiple-Access Mode and Forward Modes I and U. Therefore, by setting ti = in Eq. 
dD the maximum achievable rate of the scheme, i?MAC is: 



^MAC = max <^ min{t2Coi,t3Ci3 + i?i} + min{t3Co2,t2C23 + -^2} h (33) 

where Ri and R2 are the rates that Relays 1 and 2 provide to Destination in Multiple-Access Mode, respectively. 
These rates satisfy the multiple-access constraints in (|6]l. Lemma |5] presents achievable rates, which will be shown 
to be smaller than the capacity, by at most .71 bits, in section [V^ 



Lemma 5 The achievable rates for A > together with their corresponding scheduling are as follows: 

C' oi(Co2 + C13) Cq2^ 

Cqi + Ci3 (Coi + Ci^){Cmac — + C02) 
C m{Cm + C23) C'oi A 

Cq2 + C23 (C02 + C23){CmAC — C23 + C'oi) 



R'mo,^mac= "°r" I '" - , ^ ^ , ^ ^ /- A > 0, F' < 0, 

Rlor-MAc- Z.'^'" - , ^ ..r^" ^ . ^ ^ /- A > 0, F' > 0, (34) 



F' < 


F' > 


t2 




<2 


C02 {CmAC — C23)--\-Ci3C23 


^ C01+C13 ' 


(C'o2+C'23)(C'mAC — C'23+C'oi} ' 




Coi{CmAC — Ci3) + CisC23 


^3 


C23 


(Coi+Ci3){CmAC — Ci3 + Co2) ' 


Co2~\-C23 ' 


U 


A 


u 


A 


(C'oi+C'i3)((7m^\c — C'i3+C'o2) ' 


(Co2+C23)(C'mAC — C'23 + Coi ) ' 



where 



F' — Co2[C'i23 — C23] — C'oi[Ci23 — C13]. 



Proof: See Appendix ID-El 
It is noted that if A = 0, becomes zero and the scheme is converted to the MDF scheme. 



(35) 



B. Upper Bound 

Following the same procedure as section IIV-BI the upper bound for the case of A > 0, F' > is attained from 
by exchanging the roles of Cqi <-> C13, C02 ^ C23, T2 <-> T3, and ri ^ T4. Similarly, when A>0 and F'<0, 

Ci3, and Ti ^ T4 in ( l28T l provides the upper bound. Therefore: 

-C02A 



swapping the positions of Cqi ^ C23, C02 

3 Coi(Co2 + C13) 



K 



o4 C'o2(Coi + C23) 



Cqi + Ci3 (C123 — Ci3 + Co2)(C'oi -f C13) 

-CoiA 

'23 (^123 — C23 + Coi)(Co2 + ^23) 



C02 + C: 



+ 6 for F' < 0, 
+ d for F' > 0. 



(36) 



C. Gap Analysis 

By comparing the achievable rates ( |34] | and the upper bounds ( l36b . the gaps kJ^^^c and k^j^q are respectively 
calculated for F' < and F' > cases as: 



^MAC 



^MAC 



A d3 pi 



C02 (C123 — Cmac) A 



-up 



MDF-MAC- 



(C'oi + C13) (CmaC — ^13 + C02 ) (^123 — Ci3 + C02 ) 
Coi(Ci23 — Cmac)A 



(^002 + O23) (,OmaC ~ L.23 + Ool ) (,Oi23 — O23 + Ool ) 

To show that the above gaps are small. Lemma |6] is employed. 



+S, 
+6. 



Lemma 6 Ci 



23" 



-Cm 



< 



2- 



DRAFT 



15 



Proof: See Appendix ID-FI ■ 
Considering Lemma |6] it is straightforward to show that the gap is at most ^+(5 bits. Therefore, adding Muhiple- 
Access Mode, with independent messages sent from the relays to Destination, to the MDF scheme ensures the gap 
of less than .71 bits from the upper bounds for A > 0. 

VI. Conclusion 

In this work, we considered a dual-hop network with two parallel relays in which each transmitting node has a 
constant power constraint. We categorized the network into three classes based on the fundamental parameter of the 
network A, defined in this paper We derived explicit upper bounds for the different classes using the cut-set bound. 
Based on the upper bounds, we proved that the MDF scheme, which employs two transmission modes (Forward 
Modes I and II), achieves the capacity of the channel when A = 0. Furthermore, we analyzed the gap between the 
achievable rate of the MDF scheme and the upper bounds, showing that the gap can be large in some ranges of 
parameters when A 7^0. To guarantee the gap of at most 0.71 bits from the bounds, we added an extra broadcast or 
multiple-access mode to the baseline MDF scheme for the cases of A < and A > 0, respectively. In addition, we 
provided the asymptotic capacity analysis in the high SNR regime. Finally, we argued that when the transmitting 
nodes operate under average power constraints, the gap between the achievable scheme and the cut-set upper bound 
is at most 3.6 bits. 
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Appendix A 

Generalized Degrees of Freedom Characterization 

It is interesting to consider the asymptotic capacity of the diamond channel in the high SNR regime. A useful 
parameter in studying this capacity is the GDOF (cf. [16], [26]) defined as: 

R 

GDOF(q!) = lim , 

P^oclogP 

where R is the data-rate, P is a channel parameter (can be considered as SNR), and a. = {aoi, ao2, cus, 023} with 



a„= lim for i G {0,1, 2}, and j G {1,2,3}. 

P^oo logP 



The vector cx shows how channel gains scale with P. Based on the above definition, the following approximations 
are valid: 

Cij= ^ log(l + gij) w iaij logP, 
Coi2= ^ log(l + goi + 502) ~ ^ max{aoi, ao2} logP, 
Ci23= ^ log (1 + (^/5l3 + ^/523)^) ~ ^ max{ai3, 0123} log P, 
Cmac= ^ log(l + gi3 + 523) ~ ^ max{ai3, "23} logP, 

|a23(max{Q;oi,Q;o2} - "02) - Q!i3(max{aoi, "02} - aoi)}(logP)^ + (Tlog(P), 
r'~ |ao2(max{ai3,Q;23} - ^23) " aoi (max{ai3, 0123} - ai3)|(logP)^ + u'\og{P), 

where a and a' are positive constants. In the following analysis, it is assumed that (logP)^ terms are dominant, 
i.e., the coefficients of (logP)-^ for F and V are not zero. If this assumption is not valid, MDF scheme achieves 
the optimum GDOF of the channel. According to the above approximations, it is easy to infer: 

F < 0, if aoi < ao2; 
r > 0, if aoi > ao2; 

r' < 0, if Q!13 < q;23; 
r' > 0, if Q!i3 > Q;23- 
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Therefore, the GDOF associated with the upper bounds is: 



gdof;= 


ai3(aoi 


+ "23 j 


+ 


"23 ("01 "02 - 


- "13"23) 


"01 + 


"13 


(q;oi + "13) ("02 


— aoi + "23 j 


GDOF,2p= 


"23 ("02 


+ "13) 


+ 


"13("01"02 ' 


- "13"23) 


"02 + 


"23 


(ao2 + "23) ("01 


- ao2 + "13) ' 


GDOF^p^ 


"01 ("02 


+ "13) 


+ 


-"02("01"02 


- "13"23) 


"01 + 


"13 




(aoi + "13) ("23 


- "13 + "02) ' 


GDOF^p= 


"02 ("01 


+ "23) 


+ 


-"0l("0l"02 


- "13"23) 


"02 + 


"23 


(ao2 + "23) ("13 


- a23 + "01) 



The GDOF for different achievablity schemes is as follows: 
MDF: 



GDOFi, 



GDOF,^, 



GDOF^ipp 



"01 ("02 + "13) 



"01 + 
"02 ("01 



"13 ' 
+ "23) 



gdof; 



MDF 



"02 + 

^ "13("01 
"01 + 

^ "23 ("02 
"02 + 



"23 ' 

+ "23) 
"13 ' 

+ "13) 
"23 



MDF-BC: 



GDOF, 



MDF-BC" 



GDOF; 



"02"13("01 + "23) - Q01Q13 + "01"02"23 

(aoi + "13)("02 - "01 + "23) 
"01"23("02 + "13) - "02"23 + "0l"02"l3 



MDF-BC 



MDF-MAC: 



gdofJ, 



MDF-MAC - 



GDOF 



("02 + 

"01 ("02 + "13) 

"01 + 
"02(001 



"13 
-f "23) 



"23)("01 - "02 + "13) 

"02("01"02 - "13"23) 

(aoi + "13)("23 - "13 + "02) ■ 
"0l("0l"02 - "13"23) 



MDF-MAC 



(37) 



(38) 



(39) 



(40) 



"02 + "23 ("02 + "23)("13 - "23 + "01) 

By comparing the upper bounds on the GDOF and the achievable GDOFs, it is easy to see that MDF-BC and 
MDF-MAC achieve the optimum GDOF of the channel, while the MDF cannot achieve it for all channel parameters. 



Appendix B 

Diamond Channel with Average Power Constraints 

In this appendix, it is shown that if the transmitting nodes are subject to average power constraints, each of 
the cut-set bounds in Eq. (|9]) is increased at most by bits. This analysis confirms that the achievable schemes 
proposed in this paper with constant power constraints are still valid. In other words, they provide a gap of at most 
.71 -f < 3.6 bits from the cut-set bounds. 

Let Pg*, Pji*, and P^l*, for i e {1, • • • , 4} be the optimum power allocated to Source, Relay 1, and Relay 
2 in transmission mode i with the corresponding time interval t* leading to the cut-set bound Rq. The following 
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constraints are in effecQ 

4 
i=l 



4 

J2t*P^l*<Pn,, (41) 



i=l 



Therefore, the cut-set upper bound Rq satisfies the following constraints: 

Ro < tlC{{goi + 502)4'^*) + t*C{goiPP*) + t*C{go2PP*), 

Ro < mgo-iP^'h + n (c(.902Pf '*) + c(gi3 P4f )) + ty( gi34f ) , ^^^^ 



i?o < tlC{g2^P)t!;) + tSC(5i3P4T) + ((^5134? + V .923^4' 

Suppose that the vector t' is the solution to the LP (|9]l leading to the rate Ri . If the vector t* is used instead of 
t' in the LP the resulting rate that satisfies the conditions of the LP, called i?2, becomes smaller than It is 
clear that the increase in the cut-set bound due to the average instead of the constant power constraints (compare 
Eq. (dJ to Eq. (l4Tli). i.e., Rq — Ri is smaller than i?o — R2- Here, it is proved that Rq — R2 < j^- 

Consider each component term in the form of t*C{.) present in the inequality set ( l42b . For instance, consider 
Rc,o — ^iC(.go2-P5^'*)- The corresponding term in constructing R2 is i?c,2 — tlC{go2Ps)- Because of the power 
constraints dTIT l. i?c,o < ^1^(502"^)- Therefore, it is easy to show: 

^.o-i?.2<tIC(fHH^^ii-|l) 

[1 + 9o2Ps)tr 

,902/^5(1 -it) 



< 



2(1 + 302^5) In 2 
1 



21n2' 

where (a) is due to the fact that C{x) < for any x > 0. Similar analysis applies to each component term. It is 

observed that the first and fourth cut-set bounds in inequality set ( l42l i have three component terms and the second 

2 

ln2 ■ 



and third cut-set bounds have four component terms. Therefore, Rq — R2 < ^ 



Appendix C 
MDF Gap Analysis 

We investigate how close the MDF scheme performs to the upper bounds when A 7^ 0. First, the gap between 
the MDF scheme and the upper bound is calculated for regions specified in Table I. Then, two special cases are 
considered. 

General Case. We calculate the difference, named k, between the upper bounds and the rate offered by the MDF 
scheme from Eq. ( fTTl i for the cases shown in Table I (see Appendix |E|: 

^For the purpose of clarity, here the average powers are not set to unity. 
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— (Coi2 - 


- Coi)A 




+ '5, 




(Cm 


+ Ci3)(C'oi2 — Coi - 


f C93) 






— (^^012 - 


- Co2)A 










+ C'23)(C'oi2 — Cci2 - 


4- Cio^l 
1^ ^-13; 






(Cl23 — 


Ci3)A 








{Cm 


+ C'l3)(Cl23 ^ Cl3 ' 


f 002) 






(Ci23 — 


C23)A 




+ (5, 




(Co2 


+ C23){Ci23 — C23 - 


f Coi) 






A /Coi 


+ ^23 




C23 




Coi - 


f Ci3 V C02 


+ C23 


C012 — 


- Coi - 


V C23 




A /C02 


+ C13 




C13 






f C23 VCoi 


+ C13 


C012 — 


C02 - 


HC13 


A (Co2 


+ C13 




C02 




Coi ' 


f Ci3 V C02 


+ ^23 


C123 ~ 


Ci3 - 


V C02 


A ^Coi 


+ C23 




Coi 




Co2 - 


f C23 V Coi 


+ C13 


C123 — 


C23 - 


HCol 



K2 = 
K3 = 

K4 = 

Coi + Ci3 V C02 + C23 C012 — Coi + C23 / 
—A /C02 + C13 C13 \ 

= — —FT- \ — — — -F, —FT- ) + ^' 



Note that for the regions associated with K7 and Kg specified in Table I, C123 < C13 + C23 and hence, (5 = 0. 
To prove that k; for i G {1, • • • ,4} are small, the following lemma is needed: 



Lemma 7 



C012 - max{Coi,Co2}< ^, 
C123 - max{Ci3, C23}< 1- 



Proof: See Appendix ID-GI 
For instance, following ki < i + (5 is proved: 



^ _ (C13C23 — CoiCo2)(Coi2 — Coi) ^ ^ 
(Coi + Ci3)(Coi2 — Coi + C23) 

Ci3C23(Coi2 — Coi) , r 

< h O 

(Coi + Ci3)(Coi2 — Coi + C23) 

1 C13C23 ^ ^ 

2 (Coi + Ci3)(Coi2 — Coi + C23) 

1 Cl3 C23 , r 
= X h 

2 Coi + Ci3 C012 — Coi + C23 

where (a) comes from the fact that A > for this case. According to the corresponding region shown in Table I, 
C02 < Coi and therefore (6) is true based on Lemma |7] 

Lemmas |8] and |9] prove that K5 < ^ + (5 and K7 < 1, respectively. The proof techniques can be easily adopted to 
correspondingly show that Kg < ^ + '5, and Kg < 1. 

Lemma 8 K5 < i + <5. 

Proof: See Appendix ID-HI ■ 
Lemma 9 K7 < 1. 

Proof: See Appendix ID-II ■ 
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Two special cases are also considered: 

Symmetric Case. When Cqi = C02 and C13 = C23, F = F' = and it can be seen from Table I that the MDF 
scheme offers a data-rate that is, at most, 1 + S bits less than the corresponding upper bound. 

Partially Symmetric Case. When either Cqi = C02 with A < 0, or C13 = C23 with A > occurs, it was seen in 
section IIII-AI that fully utilizing branch 1 or branch 2 gives the same achievable rate. Table I shows that in such 
cases, the gap is less than 1 + 6 bits. 

Discussion. Multiplexing Gain (MG) of a scheme is defined in [24], [25] as: 

" SN™oo 0.51og(SNR)' 

where R is the achievable rate of the scheme. Using Eq. (fTTT i. it can be shown that the MDF scheme achieves the 
multiplexing gain of L Avestimehr, et.al proposed a broadcast mutiple-access scheme for the full-duplex diamond 
channel and proved that the scheme is within one bit from the cut-set bound [21]. In the half-duplex case, the 
multiplexing gain of 1 is lost if this approach is followed, leading to an infinite gap between the achievable rate 
and the upper bound. 

It is easy to show that, for the remaining cases (shown in Table I), the gap can be large. For instance, suppose 
Co2=x, Ci3 = C23 = ax and Coi=/3x, with a >/?>!. In this case A<0, and F>0 and therefore, the gap k is: 

—A /C02+C13 Ci3 



C02 + C23 ^Coi + Ci3 C012 — C02 + Ci3 

_ ^A / Co2(Coi2 ~ C02) + Ci3(Coi2 — Cqi) 
C02 + C23 ^ (Cqi + Ci3)(Coi2 — Cf)2 + C13) 
-A ( Co2(Coi2-Co2) 



Cq2 + C23 ^ [Cm + Ci3)(Coi2 — Cq2 + C13) 
W -A ( Co2(Coi-Co2) 



C02 + C23 ^ (Coi + C'i3)(Coi2 — Ca2 + C13) 

-A / Co2(Coi-Co2) 

C02 + C23 V (Coi + Ci3)^ 
( d) (a^-/3)(/3-l) 
(a + /3)2(a + l) ^ + 

where in (a) the nominator is decreased by Ci3(Coi2 — Cqi). To obtain (fe), C012 in the nominator is replaced by 
the smaller quantity Coi. For (c), C012 is substituted by the larger term Coi + C02 in the denominator In [d), the 
assumed values of the capacities in terms of x are substituted. It is clear that the gap increases as x becomes large. 
GDOF analysis of Appendix |A] also confirms that the MDF scheme can have a large gap from the upper bound. 

Appendix D 
Proofs 

In this appendix, the proofs of the lemmas used in this paper are provided. 
A. Proof of Lemma Q] 

We start with the fact that Coi + C02 > Coi2- Rearranging the terms, and multiplying both sides of the inequality 
by Ci3 give: 

C13C02 > C'i3(Coi2 — Coi). 

By adding Co2(Coi2 — C02) to both sides and then dividing both sides by C012— (^02 + ^13, we obtain: 

^ ^ C'i3(Coi2 — Coi) + Co2(Coi2 — C02) 
<-^012 — <-^02 + 1-^13 

Assuming C123 < C13 + C23, we divide the Right Hand Side (RHS) by C02 + C23 and the Left Hand Side (LHS) 
by the smaller quantity C123 — C13 + C02 to achieve: 

^02 - C'l3(Coi2 — C'oi) + Co2(C'oi2 — (^02) * 
-> 7^ „ , , „ TT—. = T. . 



C123 — C13 + C02 (C012 — C02 + Ci3){Co2 + C23) 

This completes the proof. 
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B. Proof of Lemma |2] 



^* _ 5Cf)2 I (Co2 + C'i3)(Coi2 — Cf)2 + Cia) — Ci3(Coi + C13) 1 

i? -i?*= - ^ ^ 



(C02 + C'23)(Co2 + C23 + 5){Cqi2 — C02 + Cis) 

a 

02 



(C02 + ^^23)^ 

where in (a), the nominator is increased by replacing C012 — C02 with Cqi, using the fact that C012 — (^02 < C'oi 
(see Eq. (|2]i). In addition, the denominator is decreased by removing 5. 

C. Proof of Lemma \3\ 

5~ C123 — (Ci3 + C23) 

1 , / 1 + .913 + 323 + 2^513323 

= 7: log ' 



2 V 1 + ffl3 + 523 + 513523 

1 , , 2^313^23 - 513523 

< - log 1 ' 



2 \ 1 + 2^513523 + 513523 

W 1 , 4, 
<2M3), 

where in (a) the denominator is decreased by replacing 313 + 323 with the smaller term 2^(713323- Defining 
2; — ^513523, it is easy to show that the maximum of log(l + j^f^:^^), for 0<a;<2, is a;* = i, i.e., 5^3523 = -j, 
which proves (&). 

D. Proof of Lemma |4] 

It is known that Cqi, C02 < C012, which proves < C23(Coi2 — ^02) and < Coi(Coi2 — Coi). Since both 
terms are positive, the sum of them is also positive, i.e., < C23(Coi2 — C02) + ^01(6*012 ~ C'oi). By adding and 
subtracting (C012 — Coi + C23)Ci3 + C01C13, the inequality can be rearranged to: 

< (C012 — Coi + C23)(Coi + C13) + (C012 — Coi)(C23 — C13) — C23(Co2 + C13). 

As mentioned earlier. Broadcast Mode is used for A < 0, i.e., C01C02 < C13C23. Therefore, both sides are 
multiplied by the positive term — A to acquire: 

< (C13C23 — C01C02) ((C012 — Coi + C23)(Coi + C13) + (C012 — Coi)(C23 — C13) — C23(Co2 + C13)). 

Now, the positive term (C012 — Coi + C23)(Coi2 ~ Coi)(Coi + Ci3)^ can be added to the RHS of the inequality 
to achieve: 

<(Ci3C23 — C01C02) ((C012 — Coi + C23)(Coi + C13) + (C012 — Coi)(C23 — C13) — C23(Co2 + C13)) 
+ (Coi2 — Coi + C23)(Coi2 — Coi)(Coi + Ci3)^. 

The above inequality can be equivalently stated as: 

(C13C23 — CoiCo2)^(Coi2 — Coi + C23)(Ci3 — C23) + C23(Co2 + C23)^ + 

Coi(Co2 + C23)(Coi + Ci3)(Coi2 — Coi + C23)< (C012 — Coi + C23)^(Coi + Ci3)'^. 

Since 1 < Coi, the LHS becomes smaller if Coi(Co2 + C23) is replaced by (C02 + C23), leading to: 

(C13C23 — CoiCo2)^(Coi2 — Cqi + C23)(Ci3 — C23) + C23(Co2 + C23)^ + 

(C02 + C23)(Coi + Ci3)(Coi2 — Coi + C23)< (C012 — Cqi + C23)^(Coi + Ci3)^. 
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Now as (^1 < i (see Eq. (|25l)). the following inequality is also true: 

Cij^S X (C13C23 — CqiC'q2) (J.Cq12 — Coi + C23)(Ci3 — C23) + C23(Co2 + 6*23)^ + 
(^02 + C'23)(Coi + Ci3)(Coi2 — Cqi + C23)|' < (C012 — Cqi + C23)^(Coi + . 

By rearranging the preceding inequality 

Cl(Cl3C23 ~ C01C02) — Cqi + C23)(Ci3 — 6*23) + C23(Co2 + C23) ) 1 

(Coi + C'l3)(C'oi2 — C'oi + C23)(^(Coi + Ci3)(Coi2 — Cqi + C23) — Cl (C'o2 + ^23)^ ^ 

which completes the proof. 
E. Proof of Lemma |5] 

The optimization ( |33] | is an LP and together with the multiple-access constraints ^ can be written as follows: 

maximize i?MAC 

subject to: i?MAC < i2Coi + i3Co2 

^MAC — Rl < ^3(^02 + C'13) 
-RmAC — i?2 < t2{Coi + C23) 
^MAC — {Rl + R2) < i2C23 + ^3^*13 

Ri < tiCiz 

i?2 < ^4(^23 

i?l + i?2 < i4C'MAC 

4 

= 1, t, > 0. 

i=2 

Using Fourier-Motzkin elimination [27], the LP can be equivalently stated as: 
maximize i?MAC 



subject to: i?MAC < hCoi + hC^ (43) 

i?MAC < <3 (Co2 + Ci3 ) + i4Ci3 (44) 

i?MAC < t2 (Coi + C23 ) + i4C23 (45) 

^MAC < ^2^23 + ^3^13 + tiCyiAC (46) 

^MAC < ^2^23 + ^3^13 + i4(C'i3 + C23) (47) 

2i?MAC < ^2(^01 + C23) + t3(Co2 + C13) + tiCuAC (48) 

2i?MAC < t2C23 + <3(Co2 + 2C13) + ^4(^13 + CmAc) (49) 
4 

Y,U = l,U> 0. (50) 

i=2 



Now, it is shown that inequalities (|47]i-(|49]l are redundant. First, since Cmac < (C'i3 + C23), the RHS of inequality 
Wh is greater than the RHS of inequality ( l46b . Therefore, inequality ( |47] | is redundant. Second, inequalities (08]) 
and ( |49] ) are simply obtained by adding inequalities ( l43l |46] | and ( l44l |46] ). respectively. Therefore, the following 
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LP gives the maximum achievable rate of this scheme; 

maximize i?MAC 

subject to: Rmac < hCoi + hCo2 (51) 

-RmAC < ^3(^02 + C13) + t4Cl3 (52) 
^MAC < hiCol + C23) + iiC23 (53) 

-Rmac < ^2(^23 + ^sC'is + ^4C'mac (54) 

4 

= 1, >0. (55) 

4 = 2 

Instead of solving the above LP, a feasible solution that satisfies all the constraints is found. This solution is not 
necessarily optimum, however it provides us with an achievable rate. For F' < inequalities dSTl i. (|52] |. and 
are set to equalities, leading to: 

Ci3 



t2 



u= 



Coi + Ci3 ' 

C'oi(C'mac — C13) + C13C23 
(C'oi + Ci3)(Cmac — C'la + C02) ' 
A 

(C'oi + Ci3)(Cmac — C'la + C'02) ' 



pi _ Coi(Co2 + C13) C'02 A 

Ooi + O13 (^Ooi + Cl3j(C/MAC — <-/i3 + O02) 

To ensure that the above results are valid, the inequality dSST l has to be satisfied. Considering inequalities (BTI ) 
and (|53] |. it is sufficient to show that t3Co2 < i3C'23. Using the values obtained in Eq. ( |56] |, this is equivalent to 
prove: 

C'02 (C'oi (Cmac ~ C'13) + C13C23) < C23(A + Ci3(Cmac ~ C'la + Co2))- 
By re-ordering the terms and using the definition of A, the above inequality can be alternatively written as: 

CmAcA < (Ci3 + C23)A, 

which is true since A>0, and Cmac = ^(1713 + 523)- 

For F' > 0, inequalities ( fSTT ), ( |53T l, and (|54] i are set to equality. In this case, the time intervals and the achievable 
rate become: 

Co2(C'maC — ^23) + C13C23 

I2 — 



t3 = 



(C'02 + C'23)(C'mAC — ^23 + C'oi) 
C'23 



C'02 + C*; 



(C'02 + C'23)(CmaC — C'23 + C'oi) ' 

2(^01 + C'23) c 

C'02 + C'23 (C'02 + C'23)(C'mAC — C'23 + C'oi) 



p2 _ C'o2(Coi + C'23) C'oi A 

-fiMDF-MAC~ 'Fi T'Fi 77^ 7~Fi \77^ ~ T'Fi \"- 
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F. Proof of Lemma |6] 



G. Proof of Lemma [7| 



C 



123 



Q 



MAC- 



■log 



1 + (VSi3 + y^Y' 



1 + 513 + 523 
2V.913 523 



< 



1 

< -. 

- 2 



1 + 513 + 523 

513 + 523 
1 + 5l3 + 523 



^ ^ 1 -'^1 f 1 + 501 + 502 \ 

6oi2 - maxjCoi, Co2|= 7: log — j r 

2 VI + niax{5roi,5o2}/ 



< ^ log ( 1 
1 

^2' 



1 + max{5oi,5o2}, 

max{ffoi,go2} 
1 + max{5oi,5o2}. 



^ rr- ^ 1 / 1 + (%/5i3 + V523) 

C123 - max{Ci3, C'23}=- 7: log , , — r 

2 V 1 + max{gi3,g23} 

= 1 lo Yl + ™Q{5i3, 523} + 2^gi3523 
2 1 + max{5i3,g23} 

<llog('l+ 

2 V 1 + niax{5i3,g23|/ 

-2 1 + 

< 1. 



//. Proof of Lemma \8\ 

In this region, Coi < 1 and Cqi < C02, therefore, < Ci3C23(Co2 — C'oi)(l ~ Coi)- It is easy to verify that 
the following inequality is valid: 

2Ci3C23(Coi(Co2 - Coi) + 0.5(Coi + C23)) < (Coi + Ci3)(Co2 + C23)(C23 + .5 + Coi(Co2 - Coi)). (58) 
Replacing C13C23 by the smaller quantity (C13C23 — C01C02) in the LHS of the above inequality results in: 

2(Ci3C23 — CoiCo2)(Coi(Co2 — Coi)+0.5(Coi+C23)) < (Coi+Ci3)(Co2+C23)(C23 + .5 + Coi(Co2 — Coi)). (59) 

Since Coi < 1 in the RHS, Coi(Co2 — Cqi) can be substituted by the larger term (C02 — Coi). Hence, the following 
inequality is true: 

- 2A(Coi(Co2 - Coi) + 0.5(Coi + C23)) < (Coi + Ci3)(Co2 + C23)(C23 + .5 + (C02 - Coi)). (60) 
Rearranging the terms leads to: 

.f^l±^_,^ ^]<l. (61) 



C, 



01 



C13 VC( 



02 



C 



23 



C 



02 



0.5 - Coi + C 



23 
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The gap can be further increased by replacing C02 + 0.5 with the smaller term (7oi2 according to Lemma [T] 

Therefore: . „ „ „ 

' Cqi + C23 C23 \ ^ 1 



which completes the proof. 



C'oi + Ci3 ^ C02 + C23 C012 — Cqi + C23 / 2 



< o- (62) 



/. Proof of Lemma |9] 



A / Cq2 + Ci3 Co 

K7 = 



Coi + Ci3 V C02 + ^23 C123 — Ci3 + C( 
( a ) A gl3 ^ . 

< X h 6 

Cqi + Ci3 C02 + C23 
(.b) A 

- (Coi+Ci3)(Co2+C23) 

< ^01 ^ Cq2 ^ g 
Cqi + Ci3 Cq2 + C23 

< 1 + 5. 

As C123 < Ci3 + C23 in this region, C123 — C13 is replaced by the larger quantity C23 to obtain (a). (6) is valid 
since C13 < 1 for this scenario. In (c), A is substituted by the larger term C01C02. 



Appendix E 
Gap Analysis Summary 

The results related to gap analysis are compactly shown in Table I. For each region specified by some conditions 
on the link capacities, the corresponding symbols for the upper bound, the achievable rate, and the gap, (i.e., the 
difference between the upper bound and the achievable rate) are shown 0. In addition, an upper bound on the value 
of the gap is given. For instance, for the region specified by A < 0, F < 0, and C02 < Cqi conditions, the upper 
bound, the achievable rate, and the gap are respectively represented by , i?MDF' ^"'1 '^i- Using the achievable 
scheme that leads to i?MDF' the gap from the upper bound R^^ is less than ^ + S. Our results, summarized in Table 
I, indicate that sending independent information during each mode together with the decode-and-forward scheme 
are sufficient to operate close to the capacity of the channel. 
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TABLE I 
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